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ABSTRACT 

Two different multivariate clustering techniques, the K-means partitioning method and the Dirichlet process 
of mixture modeling, have been applied to the BATSE Gamma-ray burst (GRB) catalog, to obtain the optimum 
number of coherent groups. In the standard paradigm, GRB are classified in only two groups, the long and 
short bursts. However, for both the clustering techniques, the optimal number of classes was found to be three, 
a result which is consistent with previous statistical analysis. In this classification, the long bursts are further 
divided into two groups which are primarily differentiated by their total fluence and duration and hence are 
named low and high fluence GRB. Analysis of GRB with known red-shifts and spectral parameters suggests 
that low fluence GRB have nearly constant isotropic energy output of 10 52 ergs while for the high fluence 
ones, the energy output ranges from 10 52 to 10 54 ergs. It is speculated that the three kinds of GRBs reflect 
three different origins: mergers of neutron star systems, mergers between white dwarfs and neutron stars, and 
collapse of massive stars 

Subject headings: Gamma Rays: Bursts - Methods: Data Analysis - Methods: Statistical 
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1. INTRODUCTION 

Although it has now been well established that Gamma- 
Ray Bursts (GRB) are of cosmological origin, their nature 
and source still remains a mystery. Detailed observations 
and studies of their afterglow emission have revealed impor- 
tant information regarding the dynami c features an d environ- 
ments of these explosive events (see iPiranl d2005h for a re- 
view). The detection of supernova light curve in the after- 
glows of long duration nearby GRB has indicated that a frac- 
tion of the GRB occur dur i ng the the collapse of a massive 
star (see lWooslev & Bloom (2006) for a review). Other mech- 
anism that could produce GRB are the merger of compact ob- 
jects like a pair of neutron stars or a neutron star with a blac k 
hole (e.g. lPiranlll992l:lGehrels et alj2005l: bloom et al.ll2006h . 
Thus GRB may be a heterogeneous group and a proper clas- 
sification of the phenomena is crucial to isolate and identify 
the possible different sources. Such a classification will also 
enable the identification of spectral or temporal correlations 
which may exist only for a particular class of GRB. 

In general, GRB have been classified into two groups of 
long (> 2 sec) and short (< 2 sec) duration bursts. This is 
based on visual inspection of the distribution of burst dura- 
tion which clearly shows two peaks. Theoretically, this may 
be understood by identifying long bursts with collapsing stars 
where the duration of the event is linked to the dynamical col- 
lapse time-scale. On the other hand, the merger of two neu- 
tron stars should occur on short timescales and hence may 
correspond to the short duration bursts. In this scenario, the 
long duration bursts should always be associated with a su- 
pernova explosion and occur in star burst regions, while short 
bursts should have no relation to star burst regions and should 
have no associated supernova. However, theoretically an- 
other mechanism to create a GRB could be the merger of 
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neutron stars with white dwarfs. These would be long du- 
ration bursts, with no associated supernova and could be a 
signif icant fraction of the total observed bursts dKing et alJ 
120071) . Observationally, there have been some evidence that 
there may be more than two classes of GRB. Some GRB have 
been recorded with low intrinsic lumino sity and do not com- 
ply with standard spec tral relationships ( Saz onov et al]l2"004t 
ISoderberg et alJl2004l) . More compelling evidence was the 
absence of a super-nova light curve in two nearby bursts, GRB 
060614 and 060505, which suggested that not all long dura- 
tion bursts are due to ma ssive stellar collapses (Gehre ls et alJ 
[20061: iFvnbo et al.ll2006h although for GRB 060 614 this re- 
sult has been disputed bv lSchaefer & Xiaol (|2006), who claim 
that this GRB is not a nearby one. While confirmation of these 
results are awaited, they do highlight the need to examine the 
possibility that there are more than two types of GRB. 

The Burst and Transient Source Experiment (BATSE) on 
board the COMPTON Gamma-Ray Observatory (CGRO) has 
provided spectral and temporal information for more than 
1500 GRB. Although BATSE provides several spectral pa- 
rameters, the bimodal distribution of GRB is based on uni- 
variate analy sis i.e. only duration is considered as a pa - 
rameter (e.g. iDezalav et al.1 119921 : iKouveliotou et alJ 11993b . 
There is a claim that even such a univariate anal ysis sup- 
ports the existence of three classes (Horvath~l ll998h . Classi- 
fication analysis taking into account more observed param- 
eters, i.e. a mult i variate analysis, was first undertaken by 
iFeigelson & Babul ([1998). Subsequently, different type of 
analysis were undertaken to classify GRB. Baumgart ( 1994) 
used a neural network technique while Ba golv et alJ (1998) 
have undertaken a factor analysis. Nonpara metric hierarchi- 
cal clu stering techniques have been used by Mukherie e et alJ 
( 1998) for 797 GRB with six variables and by Balastegui et al. 
(2001) for 1599 GRB with nine variables. The results were 
confir m ed for the complete B ATSE GRB catalog (IHorvath I 
|2002|) . lHakkila et alJ d2003|) used a unsupervi sed pattern 
recognition algorithm while IHorvath et all ((2006) have clas- 
sified GRB by fitting bivariate distributions to the observed 
duration and hardness ratios. In all these cases, the authors 
have claimed the existence of at least three classes of GRB 
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although it is not clear whether the different classifications 
found are consistent with each other, primarily because of the 
different techniques and the choice of different observed pa- 
rameters and data sets. 

These analyses are based on the observed properties of the 
GRB and are hence subject to observational biases. In fact, 
lHakkila et al. lf2000) argue that such classification techniques 
are significantly hampered and the three classes f ound is prob- 
ably due to such biases (see Horvat het alj (120061) for a counter 
argument). Indeed, a proper classification should be based 
on intrinsic rather than observed properties. However since 
the number of GRB with known red-shift and uniformly mea- 
sured temporal and spectral parameters is small, such an ex- 
ercise is presently not possible. Although it can be argued 
that certain correlations between observed parameters (e.g. 
the positive correlation between duration and flux) cannot be 
entirely due to observational bias, it is difficult to ascertain 
whether the quantitative relationships are not affected. A pru- 
dent approach may be to treat such classification as indica- 
tive of the nature of systems which should be corroborated by 
theoretical expectations and further observations. Since theo- 
retical expectations are on intrinsic properties, there is a need 
to estimate how these intrinsic properties cluster, given the 
results of a classification based on observed ones. The clas- 
sification itself should be tested for robustness using different 
types of schemes. The result of such an analysis may guide 
or be supported by theoretical models. Moreover, the results 
can lay down the broad framework and requirements of fu- 
ture observations which can confirm or rule out the proposed 
theoretical scenario. 

In this work, we use two different multivariate clustering 
techniques, the K-means partitioning method and the Dirich- 
let process of mixture modeling, to classify GRB based on 
their observed properties. These two schemes, which have 
not been used before for GRB, have the advantage that they 
do not follow any prior assumption about the number of ho- 
mogeneous classes. The optimum classification comes out 
of the process itself. The two schemes allow for post clas- 
sification discriminant analysis which can be used to verify 
the acceptability of the classification by computing classifica- 
tion/misclassification probabilities. More importantly, a GRB 
which is not in the original sample and which has only a sub- 
set of the observed properties used for the classification can be 
assigned a probability for it to be a member of a certain class. 
Thus although our analysis has been based on the BATSE cat- 
alog, GRB with known red-shifts (which have been observed 
by other instruments) can be assigned such probabilities and 
suitably classified. As we shall see, this not only provides 
qualitative estimation of any possible observational bias, but 
can constrain the intrinsic properties of the different clusters. 
We have selected 2 1 GRB with known redshifts and well con- 
strained spectral parameters for such classifications and ob- 
tained constrains on the cluster's average luminosity. 

In the next section, we briefly describe the two classification 
schemes and present the result of the analysis on the BATSE 
catalog. In §3, the classification obtained from the BATSE 
data are used to classify GRB with known red-shifts and in- 
ferences are made on the intrinsic properties of the different 
GRB groups. In §4 the work is summarized and the main 
results are discussed. 

2. CLUSTERING ANALYSIS FOR GRB DATA 

The BATSE catalog provides temporal and spectral infor- 
mation for more than 1500 GRB. The parameters include, 



two measures of burst durations, the times within which 50% 
(T50) and 90% (790) of the flux arrive, three peak fluxes, 
^64 j ^256 , Pi 024 measured in 64, 256 and 1024 ms bins respec- 
tively, four time integrated fluences F{ - F4, in the 20 - 50, 
50- 100, 100-300 and > 300 KeV spectral channels. Many 
of the param eters are highly correlated and following prev ious 
works (e.g. iMukheriee etail 119981: lHakkila et all GOOOl) we 
use the following six parameter set: logTso, log^o, logP256i 
logFj, log//32, log//32i, where Ft = F\ + is the 

total fluence while 7/32 = F3/F2 and H321 = Ft,/(F\ +F2) are 
measures of spectral hardness. The sample consists of 1594 
GRB that have non-zero detections of these parameters. We 
have not introduced any completeness criteria (like a lower 
flux cutoff), since incompleteness primarily affects the short 
duration bursts and hence is not expected to change the quali- 
tative results obtained. We retain the F4 flux (in the definition 
of Ft), despite the uncertainties in its calibration and sensitiv- 
ity, because as we discuss later in §4, the 7-ray flux > 300 keV 
is expected to have important spectral information. However, 
this fluence is not used in the computation of spectral hard- 
ness. 

2.1. Partitioning (K-means clustering) method 

Over the last several decades, different algorithms have 
been developed for Cluster Analysis which is used to find 
groups in a multivariate data set. The choice of a clustering 
algorithm depends both on the type of data available and on 
the particular purpose. Generally, clustering algorithms can 
be divided into two principal types viz. partitioning and hier- 
archical methods. A partitioning method constructs K clusters 
i.e. it classifies the data into K groups which together satisfy 
the requirement of a partition such that each group must con- 
tain at least one object and each object must belong to exactly 
one group. So there are at most as many groups as there are 
objects (K <= n). Two different clusters cannot have any ob- 
ject in common and the K groups together add up to the full 
data set. The aim is usually to uncover a structure that is al- 
ready present in the data. On the other hand, Hierarchical al- 
gorithms do not construct single partition with K clusters but 
they deal with all values of K in the same run. The extreme 
partitions with K = 1 (all objects are together in one clus- 
tered K — n (where each object forms a separate cluster) is 
a part of the output. In between all values of K = 2,3, ...«— 1 
are covered in a kind of gradual transition. The only differ- 
ence between K = r & K = r+l is that one of the r clusters 
splits in order to obtain r+l clusters or two of the (r+l) clus- 
ters combined to yield r clusters. In this method either one 
starts with K = n and move hierarchically downwards where 
at each step two clusters are merged depending on similarity 
until only one is left i.e. K = 1 (agglomerative) or the re- 
verse way where one starts with K = 1 and moves upwards 
where at each step one cluster is divided into two (depending 
on dissimilarity) until K = n (divisive). Mos t of the previ- 
ous works on GRB (e.g. IMukheriee et al.ll 19981: lHakkila et al.1 
20001 iBalastegui et al.ll200lT7 have been based on hierarchical 
clustering. However, for GRB classification, a partitioning 
method may be more applicable because (a) it tries to select 
the best clustering with K groups which is not the goal of a hi- 
erarchical method, (b) a hierarchical method can never repair 
what was done in previous steps and (c) partitioning meth- 
ods are designed to group items rather than variables into a 
collection of K clusters. 

In th is work, we apply the K- means method of Mac Oueenl 
( 1 19671) which is probably the most widely technique, to the 
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2 3 4 

No. of Clusters (K) 



TABLE 1 

Average Cluster properties based on K-means 
classification 



Parameters 



Cluster I 



Cluster II 



Cluster III 



T 50 (sec) 

r 90 (sec) 

P 2 56 (#/cm 2 /sec) 

Fj (x 1CT 6 ergs/sec) 

#32 

#321 



0.19±0.01 
0.50± 0.02 
1.66± 0.08 
0.62 ±0.04 
5.50± 0.13 
3.39± 0.08 



5.37± 0.25 
15.85±0.73 
1.26± 0.06 
2.34 ±0.11 
2.45± 0.06 
1.32± 0.03 



22.9± 1.0 
63. 1± 2.9 
2.88± 0.13 
17.8±0.8 
3.16± 0.07 
1.78±0.04 



NOTE. — Errors quoted are standard errors. The number of 
members are 423, 622 and 549 for Clusters I, II and III respectively. 



TABLE 2 

Discriminant Analysis for the K-means 
classification 



Cluster /* 
Cluster II* 
Cluster III" 



Cluster I Cluster II Cluster III 



417 
6 




28 
578 
21 



Q 

23 
526 



FIG. 1 . — The distortion curve d' K and the transformed jump curve Jk for 
different number of clusters. The largest value of Jk and the leveling of d' K 
indicates that for the K-means clustering technique the optimal number of 
clusters is three. 



BATSE catalog. For this method, t he optimum valu e of K 
can be obtained in different ways (Hartiganl 119751) . This 
is done by computing for each cluster formation ( i.e for 
number of clusters K = 2,3,4..) a distance measure cLk = 
(1 / p)min x E[(xK - ck) {xk~ck)\ which is defined as the dis- 
tance of the xk vector (values of the parameters) from the cen- 
ter of a cluster ck (which is estimated as mean value), p is the 
order of the xk vector, i.e. the number of parameters which for 
our case is six. If d' K is the estimate of die at the K th point, then 
d' K is the minimum achievable distortion associated with fit- 
ting K centers to the data. A natural way of choosing the num- 
ber of clusters is to plot d' K versus K and look for the result- 
ing distortion curve. This curve will monotonically decrease 
with increasing K, till K is greater than true number of clus- 
ters, after which the curve will level off with a smaller slope. 
This is expected since adding more clusters beyond the true 
numbe r, will simply create p artitions within a group. Accord- 
ing to lSugar & Jamesl d2003l) it is more illustrative to consider 
the transformation of the distortion curve to an appropriate 

negative power, Jk = (d' K - d' K _^^), which will exhibit a 
sharp "jump" when K equals the true number of clusters. The 
optimum number of clusters is the value of K at which the 
distortion curve levels off as well as its value associated with 
the largest jump for the transformed curve. 

Fig Q] shows the distortion curve and transformed jump 
curve for the analysis of the BATSE data. The leveling of the 
distortion curve and the shape of the jump function strongly 
suggests that the optimum number of clusters is greater than 
two and is likely to be three. The group means and the stan- 
dard errors for the six parameters for three cluster classifica- 
tion, are tabulated in Table [TJ Cluster I with 423 members 
has an average < 7/90 >^ 0.5 s can be clearly identified with 
the short duration bursts. The long duration bursts are cleanly 
separated into two clusters (clusters 2 and 3) with 622 and 549 



Total 423 622 549 



NOTE. — Clusters 1,11 and III, are the clusters 
obtained from the K-means classification. Clusters 
F ,11* and ///* are the clusters to which the GRB 
were assigned by the Discriminant analysis. 



members. 

Once the optimum classification (clustering) is obtained, 
using a process called Discrimination Analysis iJohnson I 
(1996), one can verify the acceptability of the classification 
by computing classification/misclassification probabilities for 
the different GRB. Although the K-means clustering method 
is purely a data analytic method, for classification it may be 
necessary to assume that the underlying distribution is Mul- 
tivariate Normal. In this standard procedure, using the prob- 
ability density functions in parameter space for the different 
clusters, one can assign an object (in this case a GRB) to be a 
member of a certain class. If the original classification was ro- 
bust, then every GRB should be classified again as a member 
of the same class that it was before. If a significant number 
of objects are not reclassified then that would mean that the 
original classification was not stable and hence not trustwor- 
thy. Table [2] show the result of a Discrimination Analysis, 
where the columns represent how the GRB of a cluster were 
assigned by the analysis. The fraction of correct classification 
is 0.954 which implies that the classification is indeed robust. 

2.2. Dirichlet process model based clustering 

The standard approach to model-based clustering analysis, 
is based on modeling by finite mixture of p arametric distri- 
butions. For example, Mukh eriee et alj d!998l) used such a 
model based approach to analyze GRB data where they as- 
sumed that the GRB population consists of mixture of mul- 
tivariate Gaussian classes. The number of classes is, how- 
ever, determined from an initial classification method (e.g. 
via agglomerative hierarchical clustering). The Dirichlet pro- 
cess model based clustering is more general and avoids the 
assumption of known number of possible classes. Since this 
method is less commonly used as compared to the K-means 
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technique, we describe here the basic concept on the analysis 
in more detail. 

The Dirichlet process avoids a prior assumption of 
the number of classes by applying a Bayesian non- 
parametric modeling of the unknown distribution for 
the multi component data. In this particular case 
the six component GRB data can be represented by 

Xi = (logr 50 ,logr 9 o,log/ ; V,logf , 256,log//321,log//32)',/ = 

1,2, ... ,«. More specifically, x, is assumed to follow a mul- 
tivariate normal distribution whose mean ve ctor is generated 
from a Dirichlet Process (DP). Following Escober & West! 
(1 19951) . the method is best conceptualized by representing the 
model as 



X;| W ,E~MVN( W ,E), i=l,2, 
Hi\G~G; G~DP(aG Q ) 



,«; 



(1) 



where MVN means multivariate normal distribution, G is a 
discrete measure of the unknown distribution, a is the pre- 
cision parameter and Go is a known base measure distribu- 
tion. Since G is discrete, there can be ties among the /i,'s 
, which can also be seen from polya urn representation of 
iBlackwell & MacOueen] d 1973b as 



i-i 



fjti\m,fja,...,(a-i -t— tG + -t—t T. s(p h ) (2) 

a + i—l a + i—1 

where S(x) is the distribution concentrated at the single point 
x. It is evident from Eqn. <j2j that /i, are marginally sam- 
pled from Go with positive probability and that some of the 
Hi's are identical. Thus, a partition of S = {1,2, ... ,«} can 
be formed by defining classes under the relation that /x, be- 
longs to the j th class if and only if /i,- = fijj = 1,2, 
k being the number of distinct /x,'s , i — 1,2,..., re. This in- 
duces a certain posterior distribution of S and a posterior 
inference can then be used to provide clustering procedure. 
There are various algorithms available to obtain the poste- 
rior partitions of S which are useful for making inferences 
on clustering of X\,X2, ■ ■ ■ ,X n . We implement the indepen- 
dent and identically distri buted Weighted Chinese Res taurant 
(iidWCR) algorithm (see Ish waran & Takaharal 12002) which 
comes from its use of the partition distribution of S. Let 
p = {Ci,C2, • • • ,C„(p)} be a partition of size n(p) of S, where 
each Ci contains ej elements. Assuming Go as the multivari- 
ate normal with mean vector m and covariance matrix Bq and 

denoting N p (x;fj,,E) = (27r)~7 \Y,\~t. exp[-\{x- p)'Yr l {x- p)] 
as the density of a p-component multivariate normal distribu- 
tion, the iidWCR algorithm for inducing posterior partition of 
S consists of following steps: 

Step 1: Assign p\ = {1} and the corresponding importance 
weight A(l) = N(,(x\;m,Y>o+Bo) where Eo is the initial 
estimate of E. 



TABLE 3 

Average Cluster properties based on the Dirichlet 
Mixture Modeling method 



Parameters 



Cluster I 



Cluster II 



Cluster III 



T 50 (sec) 

Tgo (sec) 

P 25 6 (#/cm 2 /sec) 

Ft (x 10~ 6 ergs/sec) 

#32 

#321 



0.31±0.02 
0.45± 0.03 
1.66± 0.08 
0.89 ±0.06 
4.68± 0.22 
2.75±0.13 



6.76± 0.31 
19.05± 0.88 
1.35±0.03 
3.46 ±0.08 
2.82± 0.06 
1.58± 0.04 



16.22± 1.50 
43.65± 3.02 
4.79± 0.33 
18.2±0.2 
3.31±0.15 
1.86±0.09 



NOTE. — Errors quoted are standard errors. The number of mem- 
bers are 409, 892 and 293 for Clusters I, II and III respectively. 



Running step 1 followed by step r for r = 2,3,... ,n gives a 
draw from posterior partition of S. This n-step draw, in fact, 
provides an iid sample from WCR density given by 



g(p) = 



f(x\p)n(p) 
MP) 



(3) 



where ir(p) is the prior density of p, f(.) is the density of 
x and A(p) = A(l) x A(2) x ... x A(n) is the importance 
weight. Repeating the above algorithm B times, one can ob- 
tain p ,p 2 , . . . ,p B iid sample observations from posterior par- 
tition of S. Based on these sample observations, Monte Carlo 
method can be devised to estimate E{n(p)}, the expected 
number of clusters, as 



E{n(p)} : 



Eti "(/>*) 
iLiA(^) 



(4) 



The key advantages of using this Dirichlet process model- 
based clustering are that the underlying distribution of x,'s and 
the number of clusters are unknown. Moreover, one can pro- 
vide an estimate of the expected number of clusters by using 
Eqn. ®. 

For fitting the model, we used a = 1.0 and a fiat prior 
G - N 6 (0,a 2 l) with a 1 = 1000. The initial value of E, 
Eq is obtained as the sample covariance matrix. We applied 
the iidWCR algorithm for B = 1000 to obtain the estimate of 
the number of clusters. Three classes were obtained consis- 
tent with the results from the K-means technique. In Table [3] 
the mean values of the parameters with errors are tabulated. 
The values are consistent with those found from the K-means 
method. The number of members of cluster II, 892 is some- 
what larger than what was found by the K-means method, 
622, but considering the different nature and approach of the 
two techniques, such differences are perhaps expected. In 
summary, these two independent clustering techniques indi- 
cate that there are three classes of GRB with qualitatively sim- 
ilar properties. 



Step r: Given p r -\, compute E r _i from x\,X2, ■ ■ ■ ,x r -\. Cre- 
ate p r by assigning label r to a new set with 
probability ( a+r _°j )A(r) x N 6 (x r ;m 7 S r _ t + B Q ). Other- 
wise, assign label r to an existing set Cf r _i with 
probability ( 'J5~\ xM X 7Y 6 (x r ; fij^i , S j^i ) where 



(V+ey.r-iE;^)- 1 and /x ;> _, = E ; >_i {Bq + 
). Note that e, r _i andx 7 r _i are the num- 
ber of elements and observed mean in C / r _i respec- 
tively and A(r) is the normalizing constant. 



3. CLASSIFICATION OF GRB WITH KNOWN RED-SHIFT 

Although the classification described in the previous sec- 
tion is based on six GRB parameters the segregation of the 
classes can be visualized using the total fiuence, Ft and the 
duration, 790. This is illustrated in Fig (O which shows Fj 
versus for the members of the three clusters obtained 
using the K-means technique. The Dirichlet process model 
gives qualitatively similar results. The solid line representing 
Tgo = 2 sec, differentiates the members of Cluster I (marked 
by triangles) with those of Cluster II (marked by filled circles). 
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T 9o ( sec ) T 80 (sec) 



FIG. 2. — The fluence Fj in ergs/cm 2 versus the duration 790 in seconds for 
members of Cluster I (triangles), II (solid circles) and III (open circles) from 
the K-means clustering method. The solid lines represent Tgo = 2s and Fj = 
1.6 x 10^/790 ergs/cm 2 which qualitatively separate the three groups. The 
solid squares represent eight GRB detected by BATSE for which redshifts are 
also measured (e.g. Bagoly et al. 20031). 



Thus Cluster I is consistent with the standard classification of 
short duration bursts. The standard long duration bursts (with 
790 > 2 s) are further classified into two groups with one of 
them (members of Cluster III, marked using open circles) hav- 
ing typically higher fluence. Thus we have named members of 
Cluster II and III as low and high fluence GRB. The solid line 
representing Ft — 1.6 x 10~ 4 /79o ergs/cm 2 qualitatively sepa- 
rates two groups. There are eight GRB de tected by BATSE fo r 
which there are redshift estimates (e.g. iBagoly et al.ll2003l) . 
These are marked by squares in Figure 2. Six of them are in 
Cluster III while two are close to the demarking line. One of 
these GRB (980425) is at a low redshift (z = 0.0085) and is 
associated with a supernova. 

To identify a GRB with a known red-shift as a member 
of a cluster, broad band coverage of the prompt emission is 
required in order to correctly estimate the total fluence Ft, 
which BATSE would have observed for the burst. This is par- 
ticularly important, when the peak of the energy spectrum of 
a GRB is at high energies > 300 keV. lAmati et all d2002l) an- 
alyzed GRB with known red-shifts and well constraned spec- 
tral parameters over a broad energy range and discovered that 
the intrinsic (i.e. red-shift corrected) peak of the energy spec- 
trum, E p correlates with the isotropic energy output, £, so . 
Apart from being a stringent condition and test for any the- 
oretical model that describes the GRB prompt emission, this 
empirical relation highlights the possibility that GRB can be 
used to prob e and constrain the exp ansion of the Universe at 
early times. iGhirlanda et alj (|2004) added more GRB to the 
sample and found that the the beaming corrected luminosity, 
E has a tighter correlation with E p than the isotropic one. 
Nevertheless, there is still significant dispersion in the rela- 
tionship which needs to be explained. In order to see how the 



FIG. 3. — The fluence Fj in ergs/cm 2 versus the duratio n 790 in seconds 
for 21 GRB (filled circles) with known redshifts taken from Ghirla nda et all 
1 2004) and references therein. The solid lines represent 7*90 = 2s and Fj = 
1.6 x 10^/790 which qualitatively distinguish the three clusters (Fig|5J. For 
each GRB, the dotted lines represent the predicted fluence and duration if 
it was located at a redshift range 0.1 < z. < 5. The open triangles represent 
the predicted fluence and duration if the GRB were located at redshift z — 1 . 
These predicted values allow for the classification of the GRB into members 
of Cluster II and III. 



classification found in this work affec ts such relationship, we 
have used 21 GRB listed in Table 1 of lGhirlanda et alj ( 120041) 
that have well measured temporal and spectral parameters. 
The total fluence Ft versus the duration Tgo for these GRB are 
plotted in Fig. (0 (filled circles). Overlaid on the plot are the 
two solid lines Ft = 1.6 x 10~ 4 /79o ergs and 790 = 2s which 
qualitatively segregate the three clusters (Fig|2|. Most of the 
GRB have high fluence and are of long duration (consistent 
with them being members of Cluster III) which is probably a 
selection effect. Two of the GRB have F T < 1.6 x 10" 4 /r 90 
ergs/cm 2 and are probably members of Cluster II. However, 
there are three GRB with Fj- ~ 1.6x NT 4 /Tgo ergs/cm 2 and 
hence there is an ambiguity about their classification. For 
each GRB, a corresponding dotted line is plotted in Fig (0 
which shows the variation of the observed Ft versus 790 if 
the same GRB was located at different red-shifts. The lines 
are drawn for a red-shift range 0.1 < z < 5.0 for a A CDM 
cosmology with fl\ = 0.7 and Hubble parameter H = 65 
km/s/Mpc. It is interesting to note that the red-shift trajec- 
tories for the high fluence GRB in general do not cross the 
Ft = 1.6 x 10~ 4 /79o line. In other words, if these GRB were 
located at a wide range of red-shifts, their observed fluence 
would have been Ft > 1.6 x 10~ 4 /rc,o and hence they would 
have been classified as members of Cluster III. On the other 
hand there are six GRB whose red-shift trajectories mostly 
lie below the de-marking line. Their observed fluence would 
be Ft < 1.6 X 10~ 4 /r9o for a wide range of red-shifts and 
hence they would be classified as members of Cluster EL The 
open triangles in the Figure mark the positions of the GRBs 
if they were all located at a red-shift, z = 1.0. In this rep- 
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FIG. 4. — The intrinsic (red-shift corrected) peak of the energy flux, Ep,,^ 
versus th e isotropic energy output for 21 GRB wi th well defined spectral pa- 
rameters ( Amati et al. 2002; Ghirlanda et al. 2004). GRB that are members of 
Cluster II (triangles) have isotropic energy output of nearly 1 52 ergs, while 
members of Cluster III (circles) have a much wider range of energy output. 



resentation, the GRB are more clearly segregated into Clus- 
ters II and III with five of them having a predicted fluence 
less than Ft > 1.6 X 10~ 4 /r9o with the others being signifi- 
cantly brighter. This strongly suggests that the classification 
described in this work is not due to observational bias arising 
from the use of observed parameters instead of intrinsic ones. 

We classify the 21 GRB according to their predicted ob- 
served fluence and duration if they were situated at z = 1 . In 
this scheme, five GRBs are classified as members of Cluster 
II while the remaining 18 are identified as members of Clus- 
ter III. Fig shows the variation of the intrinsic energy peak 
Epeak versus the isotropic energy real i zed, E j so , which is the 
correlation discovered by Am ati et ail (12002). GRB identified 
as Cluster I are marked as triangles while those belonging to 
Cluster III are represented by filled circles. GRB belonging to 
Cluster II all have an isotropic energy output close to ~ 10 52 
ergs, while those belonging to Cluster III span a much larger 
range of energy 10 52-54 ergs and roughly follow the E p —Ei s0 
correlation. Although the number of GRB in this sample 
is small, this segregation of the GRB in intrinsic parameter 
space is a another indication that the classification is robust 
and perhaps not due to observational bias. Most of the mem- 
bers of Cluster III have a rest frame peak energy E pea k > 300 
keV and hence would have had significant flux in the highest 
energy channel of BATSE, i.e. F4. Thus, the retention of F4 
in the classification analysis of §2 is important even though 
there is systematic uncertainty in the measured value of the 
fluence. Indeed, if F4 is not taken into account the evidence 
for three clusters in the BATSE sample decreases. 

4. SUMMARY AND DISCUSSION 

Two multivariate clustering techniques, the K-means par- 
titioning method and the Dirichlet process of mixture mod- 



eling, have been applied for the first time to the BATSE 
Gamma-ray burst (GRB) catalog. These two schemes do 
not make any a priori assumptions about the number of clus- 
ters, but instead provide quantitative estimate of the optimal 
number of groups. The jump curve for the K-means par- 
titioning method suggests that this optimal number is three 
which is further supported with the value of the expected num- 
ber of clusters, E{n(p)}, obtained using Dirichlet process of 
mixture modeling. The two techniques group the GRBs in 
qualitatively similar classes, which can be described as short 
bursts (790 < 2 s, Cluster I), long duration, low fluence bursts 
(Ft < 1.6 x 10~ 4 /r9o ergs/cm 2 , Cluster II) and long duration, 
high fluence bursts (Ft > 1.6 x 10~ 4 /r9o ergs/cm 2 , Cluster 
III). 

To estimate how such a classification, based on observed 
spectral and temporal parameters, can arise from intrinsic 
GRB properties, a sample of 21 GRB with known red-shifts 
and well constrained spectral parameters, are classified within 
this scheme. The observed total fluence, Ft and duration Tgo 
for these GRBs if they were located at different red-shifts 
(0.1 < z < 5) were estimated. For 16 of the 21 GRB, the 
estimated fluence would have satisfied Ft > 1.6 x 10~ 4 /79o 
ergs/cm 2 for nearly the entire range of red-shift space and 
hence they were classified as high fluence bursts. This in- 
variance in red-shift, indicates that the classification scheme 
is not strongly effected by observational bias and by the use 
of observed parameters instead of intrinsic ones. For five 
GRB, classified as low fluence bursts, the predicted fluence 
F T < 1.6 X lO^/Jgo for a significant fraction of the red-shift 
space, which again signifies the physical nature of the classifi- 
cation. Based on the classification of these GRB with known 
red-shift, it can be inferred that the low fluence GRB have a 
nearly constant isotropic Energy output of 10 52 ergs and have 
an intrinsic (red-shift corrected) duration of Tgo ~ 2-30 sees. 
On the other hand, the high fluence GRB (Cluster III) have a 
much wider range of isotropic energy output 10 52-54 ergs and 
a corresponding wide range of intrinsic durations 10 - 500 
sees. 

We note with caution that the number of GRB with known 
red-shifts, used for this analysis is small and a much larger 
sample is required before concrete conclusions can be drawn. 
It is also important, for this analysis, to have well constrained 
spectral parameters of these GRB. In particular, the peak of 
the energy fluxes, E p are required to be well estimated and 
since for high energetic sources Ep > 300 keV, it is impera- 
tive to have well calibrated high energy information. Indeed, 
if the highest energy channel of the BATSE measurement is 
not taken into account, the significance of the classification is 
smaller. 

The classification presented here, needs to be supported 
by theoretical considerations. It is tempting to identify the 
low fluence GR B with neutron star-white dwarf mergers 
dKing et al.ll2007l) and the higher fluences ones with massive 
stellar collapse. The near constancy of the isotropic energy 
output of low fluence bursts, seem to be consistent with them 
being neutron star-white dwarf mergers. Since both neutron 
stars and white dwarfs do not have significant mass variations, 
their initial conditions for the binary merger could be simi- 
lar, leading to the nearly constant energy output. Moreover, 
their merger time may also be typically smaller than mas- 
sive stellar collapse time-scales, which is consistent with the 
shorter intrinsic duration 2 - 30 s, found in this work. On the 
other hand, the energy output and duration of GRB induced by 



7 



massive stellar collapse may depend on the mass and size of 
the progenitor which is consistent with the variation inferred 
for high fiuence bursts. The present observational evidence 
for such a model is not clear. Evidence for supernova light 
curves have been detected in GRB with different energy out- 
put, including some low luminosity ones, e.g. GRB 031 1203, 
E ~ 3 x 10 4? ergs dMalesaniet al.ll2004l) , the nearby GRB 
060614, for which no supernova was detected also had a low 
isotropic energy output of 10 51 ergs. Such low energy output 
are not represented in the 21 GRB with well constrained spec- 
tral parameters used in this analysis which all have energies 
> 10 52 ergs. These rare GRB (since they have to be located 
relatively nearby to be detected) may not represent a signifi- 
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cant fraction of the BATSE catalogue. BATSE did detect GRB 
980425 which is at low redshift (z = 0.0086) and is associated 
with a supernova. At this redshift, the GRB would be a bor- 
derline case between the high and low fiuence GRB (Figure 
2). Thus the interpretation of the two different classes of long 
bursts as being due to stellar collapse and white dwarf-neutron 
star mergers, is speculative and more quantitative theoretical 
predictions and observational evidences are required before a 
definite conclusion can be made. 
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